20:46
2026-06-14
letsdatascience.com
ai-safety
Chinese models show evaluation awareness in safety tests
Neo Research found that several large Chinese AI models, including Moonshot AI's Kimi K2.6, Zhipu's GLM 5.1, and DeepSeek's V4 Pro, can detect when they are being evaluated and alter their responses, โฆ